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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AGAAATC AAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
40 1 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGCA GTATTTCTTT CAAG ACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATC ACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1 00 1 CTTGAG AAAG TCAAAGATTT TAACTTC AAC TACTATTTAA C AAAATCTTG 
1 05 1 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AA TTTTTC TA 
1 20 1 CAATATACTC CCC A AAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
125 1 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
1 40 1 ACGTCTTATG GAAATTGCTC CGATGG ATAT TCGAGGATCT CGTCGTCTCG 
1 45 1 CTGATTAGAT GATTTTTCTA TGTC ACCGAG CAACAGAAAA GTTACTCCAA 
1 50 1 AACCTATTAC TACAGAAAGA ATATTTGGG A CGTCATTATG AAAATGTCAA 
1 55 1 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GAAGAATGGA AAAAGTCGCT TGG ATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTC AAT AAGAAGATTG 
1 70 1 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
195 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
200 1 CTGCACAAAT TCTAAAGAGA A AG AATAAC A TAGTTATCGA TTCGAAAAAC 
205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAA AACAT TAATTGTTGA AGCAAAGCAA 
220 1 AG AAATTATT TTAAG AAAGA TAACTTACTT C AACCAGTCA TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
255 1 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTG ATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT C A AG A AG AAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
285 1 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT G ACCAATGTC AATCTCTAAT TC AATATG AT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDDCIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNN1SAMD RAQT1FTNIF RFNRIRKKLK DKVIEK1AYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENrYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
75 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKJKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
1 0 1 TTTGG ATG AT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
1 5 1 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
25 1 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA C ATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
65 1 TTGAGAC AAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTAC AACAGA TTACCTGTTT TG ATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CaAAAATTCC 
1 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 C AC AGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1251 TCACATTC AT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAG AAATGG A GCCG A AATCT TAATCAAAAA GAATTGCGTC G ATATTGCAA 
1401 AAGAATCGAA CTCTA AATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1 551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
165 1 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 ♦ - — * + ♦ 60 

GGCXnTTTGGGGTTTTGGGGTTTTGGGGATATTTTT^ 

a PQNPKTPKPL * KKK KLR * FR 

b PKTPKPQNPY KKR KNCGSLE 

c PKPONPKTPIKKEKIE.VV-K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 - ♦ -* 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTAT 

y : a 'NXZLFPHKWRWILIWHI.*XI 

b IXYYSRTNGDGYCFGC*Y.RXF- 
W" C * N I I ? A 0 M E M D I D V D D I * E N L - 

?5 T ACTTCCT AAT AC ATTC AAC AAGT AT AGC AGC TCTTG T AGTGA C AAG AAAGG ATG C AAAA 

tfi 121 - * * * * * 180 

jhj ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTAC 

"Si a YFLIHSTS I AALVVTRKDAK 

|f! b TS*YIQQV-QLL"'QERMQN- 

c lpntfnkyssscsdkkockt- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAA^ 

181 <► * 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 



- a HCNLARKR LHC LFQ SCKNN 

N: b IEIWLEIAF I DYS KVAKTIR- 

"pf- : C LKSCSKSPSLTIPKLQKQLE- 

Q AG TTCT ACTTCTC GG ATG C AAATC TTT AT AACG ATTC TTTCTTG AG AAAATT AGTTTT AA 

S] 241 - * 300 

S M TCAAGATGAAGAGCCTACGTTT AG AAATATTGCTAAGAAAGAACTCTTTTAATC AAAATT 

a SSTSRMQI F IT I LSCEN* F • 

b VLLLCCXSL • RFFLEK ISFK 

C FYFSDANLYNDS T L R KLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE*KLKHY * C L N K I R * 

b KRRAKSRNCNITNV* IKSGN- 

C SGEQRVSIETLLMFK * N Q V M - 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 42C 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a CCLFYFLDHFLRS IMEKIT* 

b EOYSIF" ITS "GALWRKLLN- 

c RIILFFRSLLKEHYGENYLI- 

TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 - - ♦ 480 

ATG ATTTTCC ATTTGTC AAACCT AAT AAAC^G ATCGGTTGTT ACT A CTC AT AT AATTTAA 

a Y'KVNSLDYFPSOOCCVY'I 

b TKR'TVWI IS LANNOEY I KF 

c LKCXOFCLFP'PTHHSILNS- 
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CATATGAGAATCAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 
GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 



HMRMSQR I S I HQTYQRQTRY 
b ICECVKGSRY I RLTKD KL A I 

c 



Y ENESKDLD.TS DLPKTNSL 
AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 
TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGA 



KTQEKVC ~SN SRRTYCIYYS 
b KRKKKFDNRTAEELIAFTIR- 
c NARKSLIIEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTCTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 - - * ♦ ** * 6€ 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 



YGFYYNC FRY R RCTPESCDN*- 
b MGFITIVLGI DGELPSLET I 

C WVLLQLF-VSTVNSRVLRQL- 

TG AAAAAGC TGTTT AC AACTG AAGG AATCGC AGTTCTG AAAGTTCTG ATGTGTATGCCAT 

U- 661 720 

m ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

111 a CKSCLOLKESQFCKF'CVCH 
bib EKAVYNCRNRSSESSDVYAI 
J^jc KKLFTTEG I AVLKVLMCMPL- 

I f\ TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGC^ATAGAAACAAA 

?J 721 - " — — — 7 *0 

pi AT AAAAC AC TT AATTAG AGTTT AT AG AAT AG AGTT AAATT ACCT ATCGAT ATCTTTGTTT 

1. a YFVN * S Q I SY LNLMDSYRNK 

Ob ILCINLKYLISI WIAIETN- 

UC FCELISNILSQFNG-L-KOT- 

CCAAATAAACCATCCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

^ 781 * * * 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 



PNXPCKFNCIYVKSFCTKAH 
SU b QINHASLMEYTLN PLGQMHT 

c K ' T v . 0 V * W N I R " I LWDKCTL- 

TG AATTT AT ATTGG ATTC TT AAAGC ATAG AT AC AC AG AATGCTTT AG AG ACTG ATTTAGC 

841 900 

ACTTSAATA 7 A.ACCT AACAATTTCCTATCTATCTCTCTT ACGAAATCTCTGACTAAATCG 

a CIYICFIKHRYTECFRDCFS 

b ErI LDS * S IDTQNALETO LA 

c NLr--!L KA-rKRML*RLI-L- 

TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 - - 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQiTCFDYSCSSLISLKEA 

b YNRLPVLITL AHLLYL'KKQ- 

c TTDYLFCLLLLISY I FKRSR- 

GCCCAAATGAAAACAACACTAAACAAACACATTTCAAAATTTGTTCA 

961 ♦ 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a -CEHKRRLKKE 1 SKFVOSSVT 

b AKCKED'RKR F Q N L L I L L * P 

c R N E K K ? X E R D FK I CC FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 - 1080 

CCTTAATTGTTCTTCTTATA.ATCCTTCCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a c!KrJKN:»NEKEEELSOSCF 

b ELTTR ILATK KKK SYHNPDS 

C K-00EY'O?KRaRAITILIL- 
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TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

10 81 ♦ ♦ 

AATTTCTAAAGTTTTTAACGTCCATTCTCTCTA 

a LKISKIPGKRDTFI K IH I L 

b * RFQKFQVRE IHSLKFI YYS 

c KDFKNSR * E R Y IH * N S Y I I V - 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAAT 

1141 * ♦ — * 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFI LTI FFD * L E 

b FSFHSCYFLLS * Q Y F L I S W K - 

c FHFTAVI FFYLNNI FCLAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 * * ♦ * * * 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAG 

a VKSIK'EKR • T E V T * LIH IH 

b •KVSNKRSARLR * LSLFTF I 

c KKYQIREALDCGNLAYSHS • - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 
Pi 1261 -- * * - 1320 

:J tctagctggaagtatataggttatgctactattcctttgtcgtcagtaggcaaaattttt 

if \ a RSTFIYPIRC'GNSSHPF'K 

'f[ b DRPSYIQYDDKETAV I RFKN- 

Ijj c IDLHISNTMIRKQOSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

111 1321 -- * * *~ 1380 

m ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a •CYED-IFRVKKWSRNLNQK 
l&'b SAMRTKFLESRNGAE I L IKK 

M c VLCGLNF * SQEMEPKS • SKR- 

! 7 GAATTGCCTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 1440 

ff| CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

Q a ELR3YCKR I EL." I FR * * V L P 

n;b NCVDIAKESNSKSFVNKYYO- 
c IASILQXNRTLNLSLI5ITN- 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 - 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACCTGTCTTCTAGTAATTTCTTTATTT 

a I LIDCRO " RCNCT EDH " RNK 

b SCLIEEIOEATAQKI I X E ! K - 

c LDCuKRLTRQLHRRSLKK • S- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 1560 

CATTCAAAATAJVTTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN-RIN - ITN I Z I SDLQ 

b tLLIRE'TKLLE "RSAIFN- 

C NFY'LENKLNY" YROQRSS I - 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 - - 1620 

AACTGCTTTATTTTCGACTTCATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK'KLN • S • TIKNTNLCQN 

b CRNXSCTKVRQ* K IQTLVK I 

c DEIXAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAACGCAATAAATAAAATGA 

1621 - 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCCTTATTTATTTTACT 

a IEEGXEOQLAKEK IRQ * ! KC 

b LRKEKKTS*QKKK'CNX-NE- 

C CCRXRRPVSKRKNXAtNKHS- 
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G T AC AG AA GTG AAG AAAT AAAAG ATTT ATTTTTTTC AAT AATTT ATTG AAAAG AGGGGTT 
C ATGTC TTC ACTTCTTT ATTTTCT AAAT AAAAAAAGTTATT AAATAACTTTTCTCCCC AA 



a vQKCRNKRFIFFtfNLLKRCV 

b YRSEEIKOLrFSXIYCKEGF- 

c TtVKK • KI YF FQ * FI EKRCF- 

TTGGCGTTTTGGGGTTTTGGGG 

174 1 - ----- 1762 

AACCCCAAAACCCCAAAACCCC 

a LCFWGFG 

b WCFGVLG- 

C G V L G F W 
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2 EVDVDNOAONHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNOSQSHYKDL 51 

|:- i -.::|: =1 |. = :| ::- • I l-l-LI 

19 ELE1EMQENQNDIQVRVK I DDPKQY . . LVNVTAACLLQEGSYYQDK 62 

S2 EDIXrr AOTNIVATPRDYNEEDFKVIARKEVF.STGLMIELIDKCLVELL 100 

-U ..(.. ,|: ..:| | ...::(.: 
63 DERHY: ITKALL . . . EVAESDPEFICQLAVYIRNELYIRTTTNYI VAF . 107 

!G1 S S S D7S DRQKLQC FG FQLKGNQLAXTHLLTALSTQKQYFFQDEWNQ VRAM ISO 

108 CWHKNTOPFIEKYFNKAVLLPNDLLEVCEFAOVLYI 144 

LSI IGNELrKHLYTXYLIFORTSEGTLVOFCGNNVFDHLKVNDKFDKKOKGGA 200 

145 FDA7EFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

:.!; .::|.: . | ... ::J. . | | :. | . : 
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221 .RKENXLTKLOAIKESEDKSKRETG D I MNVED A I KALK P A VM K K I 264 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1 141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca-aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcatlagtc cttggtttga tggtaaaata 
1 74 1 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
2041 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 
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FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
54 1 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact.gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
f*& 901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 

p 961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 

O 1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
U'\ 1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
y j 1 141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 

1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
jfj 1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
^- 1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
!L : 1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
J"f 1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
r|* 1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
s m 1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
p5 1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
SI 1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1 80 1 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatactttt aagtagaacg aatmaatt taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaarttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcanatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaaicg 
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ETDYDTEKWEISHDQK>IYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAE 
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QYNFEKIGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKiVDTNKVQDYFKFLQEFPRLTHVSQQAIPVSATNAVENLNVLLKKViaiANLNL 

tj VSIPTQF>IFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNLKPLRLNFYTY 

9 VAQETSRKQlLKQATTIKNLK>WKhlQEETPETKI)ETPSESTSGMKFFD 

S FSVNLQATQEIYDSLHKLLIRSTNLKKFKLSYKYEME 

|j RCSVNISNPHGNISYELTNKDSTFYKFKLTLNQELQHAK^ 

p SSSLESLEDIDSLCKSIASCIGMLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 
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LCLPTGTYYDYNSDRW 
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MKILFEFIQDKLDIDLQTNSTYKLENLKCGHFNGLDEILTTCFAL 

PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKIARNEDVNNSLFCHS 

ANVNVTLLKGAAWKMFHSLVGTYAFVDLLINYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSS1KKLTDLR 

EAIFPTNLVKJPQPJ.KVPJNLTLQKI.LKJIHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKFIIVILQKLLPQEMFGS1GCNKGIGIKNLNLLLSLPLNGYLPFDSLLKKL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VWRHDTWhnCLITPFIVEYFKTYLVEhn^CRNHNSYTLSNFNH 

IIAIPCRGADEEEFTIYKENHKWAIQPTQKlLEYLRNKPvPTSFTKrYSPTQIADRJKE 

FKQRLLKKFN1WLPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TNTGVLKLFhP/VNASRVPKPYELYIDNVRTVH^ 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

INIKJ<JLAMGGFQKYNAKANRDKJLAVSSQSDDDTVIQFCAMmFVKELEVWKHSSTMN 

NFH1RSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSrNVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNHLLRKEIQHLQAYIYIYIHIVN 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LC VS S I LSS F Y Y ATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

AKPLHWLHSVYWELLRSFFYVTETTFQKHR 



I8EIEWLVLGKR6NAXMCL6DFEKRKQIFAEFIYWLYNSFIIPILQSFFYITB6SDLRNR 

LKDFRWLFISD IWFTKHHFENLNCUIICFISWLPRQLIPKI IQTFFYCTEISSTVT- 

TREI5WMQVET-8AKHPYYFDKEN-1YVLWKLLRWIFBDLWSLIRCFFYVTEQ0KSYSK 



LFFYRKSWSKL5SI0IRQHLKRVQLRDVSEAEVRQHREARPALLTSRLRFIPKP — DOL 
TVYFRKDI*KLLCRPFI -TSMKMEAFEXI NEHNVRKDTQK-TTLPPAV I RLLPKK — OTP 
IVYFRHDTWNKLITPFIVEYFKTYLVEKKVCRKHNSYTLS--HFNHSKHRIIPKK5NNEF 
T Y Y YRKN IWDV IHKMS I -ADIrKKETLAEVQBXEVEEWKKS -IrGFAPGKLRL I PXK — TTF 



Motif 2 

RPIVNMDYWGARTFRREKRAERLTSRVKALF-8VLMYERA 

RLX TN-LRKRFL I KHOS HKKMLV8TNQTLRPVAS II-KHLiI UEESSGI PFKLEVYMKLLTF 
Rl I A I PCRGADEEEFT I YK ENHKtfAI QPTQKILEYLRNKRPTS FTK I YSPT$I ADRI KEF 
RPIMTFKKKIVNEDRKTTKLTTNTKIXH6HLMLKTLKH-RMFKDPPGPAVFHYDDVMKKY 



KKDLLKHRMFGR-KKYFVRIDIKSCYDRIKQDLMFRIVKK^KLKDPEPVIRKYATIHATS 
KQRLLKKFNKVLPELYFMKFDVKSCYDSIPRMECMRILKD^ALKNENGFFVRSQYFFMTM 
EBFVCKWKQVGQPKLFFATMDIEKCYD6VNREKLSTFLKTTXLLSSDFWIMTAQILXRKN 



Motif 1 



* 



* 



Motif 3 (A) 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIGURE 27 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 



FIGURE 28 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTT ATGTC ACGG AG ACCACGTTTCAAAAG AAC AGGCTC 1 1 1 I 1CTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 



FIGURE 29 



MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSN1CERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLE1IGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 

RTIETSITQNKSARXEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHR1SLSKVYNHYCPYIDTHDDEK1LS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIK1SEIEWLVL 

GKJISNAKT^CLSDFEKJIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKIWENNVRMDTQKTTLPPAVIRLLPKXNTFRLITNLRKRFLIKMGSNKKMLVSTNQT 

LRPVASILKHLrNEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVK1GNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKJGDAKXFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRR1AD 



I 




FIGURE 30 

ggtaccgatttacraccmcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatattraacaagaactcaataacaataccaagtcaaanccaatatgaa 
tgttattagtgatcgataatamctattttatcggtcgttaccaagtate^ 

ggncgcttactmaatcgtggtactgtmagctgctecttctagccaaccgcgtgtttctaccccgtcattggatatagctcttggagtagctcacagaaate 
ctgatgagactatattagancattacagtccgtgcatattcttaacatggagccttaractttegatgagtcacgtcgcatgatggagtamggtatcatccaacgtttgccng 
aaaaggttgataanamgcaaaatcatgtccttagjggtggtaat^ 
ctccttmcgcggttttattmmtmctattctcatg^ 

aataatcuaanagtncgcttataangatagtagtagaaagattggtgattctactcgtgtaatgttattagmaaagatacmgcaaaacatttattag 
aaaatcctataattataaatattaatcaatamgcggtcactatttatttaaaacgttatgat^ 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 
CCTTAAATGATTATGTACAACTTGTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 
CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTAGTCGGCTTCGACAGT 
AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatatttttgttttgatttttttctattcg 
ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 
ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaattgtgaaatatttacctgcaattactgmcaaagaga ngtattta accgataaagAA 
TCATGAAGATTTTCGAGCCATGCATGTAAACGGA GTACA AAATGATCTCGTTTCTACTTTTCCTAATTA 
CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggttaagatgttgcgcactttgaaca 
agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 
CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 
I CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 
• CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 
! aactaatactgttatccncataacmtttmgATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 
- TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 
1 AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 
1 ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 
' CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 
CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 
S tattgtataaaamattaccactaacgattttaccagACCTCGAAACTTTCTTGAAATTATCGAGATACGAGTCTTTTAGTTTAC 
ATTATTTAATGAGTAACATAAAGgmtatgccaaattttmaccattaattaacaatcagATTTCAGAAATTGAATGGCTAGT 
CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGA AACGCAAG CAAATATTTGCGG 
i A ATTC ATCTACTGGCTATACA ATTCGTTTATA ATACCTATTTTACA ATC 1 1 1 III 1 1 ATATCACTGAATC 
1 AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 
' TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtannaaagtattttttgcaaaaagctaatattncagAACAA 
TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 
CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtattaatttttggtcatcaatgtacmacttctaatctattattag 
cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 
AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 
TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaatmgcagGCGTAAGAAG 
TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 
AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 
AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgm^ 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATTTTGT 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaattgttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagttctaaccgttgaagGATTTGAGAA 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATACTTTT7TTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 

ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGGA AATC 1 " ! ' I 1 I I TTACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacttattttaactagaaaagtcattaattaaccttagATC I I I TGAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGT7TCTTGTCCTCTGCAGAAGTCAA 



FIGURE 30 (cont) 



ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTlTTlTGTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAAT^ 

tattactggtgtcttaaacaatattattactaagtatagctgacccccaaagcaagcatact^ 

atccttatactmaagaaagattgacagtggttgctgactactgcccacatgcccatta^ 

aataaggaaagtggtmctataatgaataatgcccgcactaatgcaaaaagacgaagattatctt^^ 

acccagtgngrtgaagaaagcaaggataamggaacaagcttctgcagatgacaggctaaatmggtgaccgaatmggtaaaagccccaggttatccatggtgg 
gccttgctactgagacgaaaagaaactaaggatagmgaatactaatagctcam 

gtgttaagccattattggartccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgatttaaaggag 
gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcangggagacatctcttgatgaatcagatgcgga 
gagtatctccagcggatccngatgtcaataacttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 



« % 



FIGURE 31 



EST2 pep 

Euplotes pep 

Trans of Cetrahymen 

Consensus 



FFYCTEISST VTIVYFRHDT WN KLIT P FIVE YFK-TYLVEN 

FFYVTEQQKS YSKTYYYHKN IWDVT-MKMS IAD LKK ETLA — EVQE 

KHKE GSQIFYYRXP XWKLVSKLTI VKVRIQFSEK NKQMKNNFYQ 

FFY.TE..K. .S. . YYYRK. IW...-KL F. -K V. . 



40 
43 
44 

50 



EST2 pep 

Euplotes pep 

Trans of cetrahymen 

Consensus 



NVCRNHNSY- TLSNFNHSjKjjl^ 

KEVKEWKKSL GFAPC 

KIQLEEENLE KVEEKLIPED SFQKYPQC&p 

K. . .E F . .<M£% 




FBIIMPCRG 
FK F I tfTFNKK 
j&FItfTFLRK 

EiayiMTF.RK 



79 
78 
92 

100 



MEST2 pep 
QEuplotes pep 
STrans of tetrahymen 

|jr]Consensus 



ADEEBFTIYK ENHKNAIQP^VQKILE^tRNK 
rVNSDRKTTK LTT^^^U^|^^MrJCIi^- 
DKQKNIK IJWLNQILMDS^LVF: 

K. .K LN.N. .L. .S -QL.L. .UQi- 



RPTSFTKIYS FTQIADRIKE 

RMFK -DPFGFAVFN 

- ML-G -QXIGYSVFD 

. . . . .IG. -VF". 



129 
120 
130 

150 



\|EST2 pep 

If {Euplotes pep 

ffiTrans of tetrahymen 

* Consensus 



FKQFO,U<fi^N 



YD- 
NK-QIi 



.pg 



NVL 

EFVCKWKQVG C 
•A QFIEKWKNKG F 



K-. . .HKF. .F. .KWK. .G JM. 



fFMKFD VKSCYD 
FATMD IEKCYD 
fYVTL- 



157 
155 
158 

186 



FIGURE 32 



S-l: FFY VTE TTF QKN RLF FYR KSV WSK 
S-2 : RQH LKR VQL RD V SEA EVR QHR EA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AiCF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FTP KPD GLR PIV NMD YW 
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Vector 



Genomic DNA Insert 



Vector 



m 



A5 



B2 



5825 bp Sequenced 
-2 kb Hind li! Fragment 



1 kb 



tezl+ 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 



mm ) urn mm i %m mm m m i fi ii 



mm 



introns 



2 J 4 



cDNA 
5-20|cDNA 





RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Bdnc 
RT-PCR w/ M2-B15 Bene 
RT-PCR w/ M2-B16Bcjnc 



Hind ill 



500 bp 
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Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B') 



5(C) 

D D Y L L I T 

ctg ctg atg gag gag tag tgg -5' 
a a aaaaa a a 
t t t t 
c c 
Polv 1 



FIGURE 35 




FIGURE 36 



PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



Ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 

* * * 



Q MC V G I P Q G 

caapkaa gtt ggt ate cct cag gg < Actual Genomic Sequence. 

LU t ■ t c 

t a-y a g c c teg 

cagiip|cc aaa gga att cca taa gg > 



ag.acc aaa 


gga 


att 


cca 


tea 


ggC 


TCA 


ATT 


CTG 


TCA 


TCT 


TTT 


TTG 


TGT 


CAT 


TTC 


TAT 


ATG 


tcf^gg ttt 


cct 


taa 


ggt 


agt 


ccG 


AGT 


TAA 


GAC 


AGT 


AGA 


AAA 


AAC 


ACA 


GTA 


AAG 


ATA 


TAC 


3 

ffi 


G 


I 


P 


S 


G 


S 


I 


L 


S 


S 


F 


L 


C 


H 


F 

< 


Y 


M 


GAA'^AT TTG 


ATT 


GAT 


GAA 


TAC 


CTA 


TCG 


TTT 


ACG 


AAA 


AAG 


AAA 


GGA 


TCA 


GTG 


TTG 


TTA 


CGA 


CTT CTA AAC 


TAA 


CTA 


CTT 


ATG 


GAT 


AGC 


AAA 


TGC 


TTT 


TTC 


TTT 


CCT 


AGT 


CAC 


AAC 


AAT 


GCT 


E D L 


I 


D 


E 


Y 


L 


S 


F 


T 


K 


K 


K 


G 


S 


V 


L 


L 


R 



GTA GTC gac gac 
CAT CAG ctg ctg 

V V D D 

< ctg ctg 

a a 



etc 


etc 


ate 


acc 


gag 


gag 


tag 


tgg 


L 


L 


I 


T 


gag 


gag 


tag 


tgg 


a a 


a a 


a 


a 


t 


t 


t 


t 


c 


c 







Polv 1 



gac gac ttc etc ttt ata aca < Actual Genomic Sequence. 

D D F L F I T 



FIGURE 37 



3* RT PGR Strategy 



mRNA 



5' 



IAAAAAAAAAAAAAAAA. 
TTTTTTTTTTTTTTT ■ 



3' 



3'- 



Qo 
Qi 



1 . Synthesis of cONA with Q| Primer. 
mRNA 



E*a5' 



IAAAAAAAAAAAAAAAA 

I t t t tt i" r r r i T TTrr ■■■■■ 



2 Wrst Round PCR Using Outside Primer and Qq Primer. 



I T TTTTTTTTTTT TT T 



€>0 



3L$econd Round PCR Using Inside Primer and Q\ Primer. 



Qi 



4. Sequence Second Round PCR Products Using inside Primer or Q| Primer. 
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-Size Selected Libraries from P. Nurese 
3~4kb 
5~6kb 
7~8kb 
11 - 12 kb 

-Libraries from J. A. Wise 
Sau 3a Partial Digest 
Hind Hi Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
C REP81ES Library (old) 
REP81ES Library (new) 
REP41ES Library 




6. gj 



Ol^inal PCR 




500 bp 
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A5 B2 




Hind III Digested Positive Genomic Clones 
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5' RT PCR Strategy 



mRNA 



5' 

ddA M ~P 



I AAAAAAAAAAAAAAAA. 



1 . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 

3* ■HBHMMI 



I AAAAAAAAAAAAAAAA . 



Sugate Oligo with 5*-P and blocked 3* to cDNA using T4 RNA Ligase. 




4. Second Round PCR 




FIGURE 41 



Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIW ...(35)... 

S.c. Est2p (366). WLFRQLI PKI IQTFFYCTEI 3STVT- IVYFRHDTW ...(35)... 

E.a. p!23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW ...(35)... 

* ***** *** 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.p. Tezlp AV I RLL PKK - - NTFRL I TN- LRKRF ...(61)... 
S.c. Est2p SKMRI I PKKSNNEFRI IAIPCRGAD ...(62)... 
N; E.a. pl23 GKLR1 I PKK- -TTFRPIMTFNKKIV ...(61)... 

o * * * * * * * 

0 Motif 3(A) AF 

m h hDh GY h 

W S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK ...(89)... 

H S.c. Est2p ELYFMKFDVKSCYDSIPRMECMRILK ...(75)... 

W\ E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK . . . (107) . . . 

ffi ***** * 

'.fx: 

* Motif 4(B') 

Q hPQG pP hh h 

H S.p. Tezlp YLQKVG I PQGS ILS SFLCHF YMEDL I DE YLSF ...(6)... 

y ; S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF ...(8)... 

K; E.a. pl23 YKQTKG I PQGLCVSS ILSSFYYATLEESSLGF ...(14)... 



Y Motif 5(C) Motif 6(D) 

h F DDhhh Gh h cK h 

S.p. Tezlp VLLRVVDDFLFITVNKKDAKKFLNLSLRGFEKHnFSTSLEKTVINFENS .(205) 
S.c. Est2p LI LKLADDFLI ISTDQQQVINIKKLAMGGFQKYI\ T AKAT^RDKILAVSSQS .(173) 
E.a. pl23 L1MP.LTDDYLLITTQENNAVLFIEKLINVSRENGFKFMMKKLQTSFPLS . (209) 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 
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wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 
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wild type telomerase gene 



2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 
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telomerase gene::M 



Wild Type (X2) 



Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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FIGURE 44 



An Example of Confirmation of tezl disruption By PCR 
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FIGURE 45 



Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 



1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 

2 41 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1MTEHHTPKSRILRFLENQYV 20 

1019UTAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
21rJ LCTLNDYVQLVLRGS PASS 40 

1079i|5rAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
4ljhY SNICERLRSDVQTSFSIFL 60 

1139s^|aT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
Sim STVVGFDSKPDEGVQFSSP 80 
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FIGURE 46 (cont) 

1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 
81 K C S Q S E Q6 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 
87L I ANVVKQMFDESFERRRNL 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagaga 1405 
107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 
114 NHEDFRAMHVNGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129 DLVSTFPNYL I S I LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 
149 L L L E I I G 155 

1602 pGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 
15611 DAMHYLLSKGSIFEALPND 175 

1662 t^AT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 
176^ Y'LQISGIPL FKNNVFEETV 195 



1722 fi CA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

196 J KKRKRTIETSITQNKSARK 215 

1782 gAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 
216jl|VSWNSISISRFSIFYRSSY235 

m 

1842 p^AG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaatt ttag AT CTA TAT TTT AAC 

1907 51 

236 l k K F K Q D L Y F N 245 

1908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 
246 L H S I C DR NT V H M W L Q W I F P R 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 Q FGLINAFQVKQLHKVIPLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 SQSTVVPKRLLKVYPLIEQT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 A KRLHRISL S KVYNHYC PY I 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

326 D T HDDEKILSYSLKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

346 F L RS I LVRV F PK L I WGNQR I 365 

2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattt taccag AC CTC GAA ACT 2336 

366 F E I I L K D L E T 375 
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2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 

396 ISEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 K R S NAKMC L S D F E KRK Q I FA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFI I PILQSFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2645 

446 I T ESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

466 LLCRPFITSMKMEAFEKINE 485 

27S gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

4§| NNVRMDTQKT 495 

|f] 

2im ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

494 TLPPAVIRLL PK KNTFRL IT 515 

B fX 

28® AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 

5J6N L R K R F L I K 524 

2 9£& ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

5|5 MGSNKKMLVSTNQTLRPV 542 

2911 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

sS|a SILKHLINEESSGIPFNLE 562 

3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 V YMKLL.TFKKDLLKHRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 K SCYDR IKQDLMFRIVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 327 5 

612 K D PEFVI RKY AT I HATSD RA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 TKNFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SDTLFVDFVDYWTKSSSEI F 679 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 

680 KMLKEHLSGHIVK 692 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 IGNSQY LQKVGI PQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 

709 I LSSFLCHFYMEDLIDEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 FTKKKGSVLLRVVDDFLFI T 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 

749 VNKKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA £GC AAG AAA 3900 

779 IN FENSNGI I NNTF FNESK K 798 

3 9 Op AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 

79pR MPFFGFSVNMRSLDTLLAC 818 

396lflcCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

8lWP KIDEALFNSTSVELTKHMG 838 

402Sf| AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 
83EJK SFFYKILR S848 

4 09fipAGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

84ffeS LASFAQVF IDITHNSKFNS 868 

41SIHTGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 

86pC C NIYRLGY SMCMRAQAYLK 888 

42ltf AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttatt ttaactaga 4274 

889 RM KDIFIPQRMFITD 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA JGGA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 

918 EI LGYTSRRF LSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 

936 LFCLGMRDGLK 946 

4 469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 

947 PSFKYHPCFEQLIYQFQSLT 966 

4 529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 

967 D LIKPLRPVLRQVLFLHRRI 986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 
987 A D * 989 
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4 666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct: 4745 

4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4 825 

4 826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4 906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 

4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 

5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 

5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 

5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 

5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 

5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 

5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 



FIGURE 47 



1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG atg agt gtg tac gtc gtc gag ctg ctc 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 

agg tct ttc ttt tat gtc acg gag acc acg ttt caa aag aac agg 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg' val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

• 120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 

GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 

TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
GAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro. 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 

leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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leu gin thr val cys 
CTC CAG ACG GTG TGC 

400 

ala tyr arg phe his 
GCG TAC AGG TTT CAC 



gin val trp lys asn 
CAA GTT TGG AAG AAC 

430 

arg leu pro leu leu 
CGG CTC CCT CTG CTA 



y. val ala gly gly gin 
Q GTC GCT GGG GGC CAA 

ijj 460 

i*i arg ala val ala val 
* CGT GCA GTG GCT GTG 

^ thr pro cys his leu 

1, ACA CCG TGT CAC CTA 

D 

i; 490 

pro asp ala ala glu 
W CCA GAC GCA GCT GAG 

HI 

pro gly gly arg ser 
CCT GGA GGC CGC AGC 

520 

his pro gly leu met 
CAT CCT GGA CTG ATG 



thr pro ala ala leu 
ACA CCA GCA GCC CTG 

550 

ar 9 gly gly pro his 

AGG GGC GGC CCA CAC 



390 

thr asn ile tyr lys 
ACC AAC ATC TAC AAG 

ala cys val leu gin 
GCA TGT GTG CTG CAG 

420 

pro his phe ser cys 
CCA CAT TTT TCC TGC 

leu his pro glu ser 
CTC CAT CCT GAA AGC 

450 

gly arg arg arg pro 
GGG CGC CGC CGG CCC 



pro pro ser ile pro 
CCA CCA AGC ATT CCT 

480 

arg ala thr pro gly 
CGT GCC ACT CCT GGG 



ser glu ala pro gly 
TCG GAA GCT CCC GGG 

510 

gin pro gly thr ala 
CAA CCC GGC ACT GCC 



ala thr arg pro gin 
GCC ACC CGC CCA CAG 

540 

ser arg arg ala tyr 
TCA CGC CGG GCT TAT 



pro gly leu his arg 
CCA GGC CTG CAC CGC 



ile leu leu leu gin 
ATC CTC CTG CTG CAG 

410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 



ala ser ser leu thr 
GCG TCA TCT CTG ACA 

440 

gin glu arg arg asp 
CAA GAA' CGC AGG .GAT 



ser ala leu arg gly 
TCT GCC CTC CGA GGC 

470 

ala gin ala asp ser 
GCT CAA GCT GAC TCG 



val thr gin asp ser 
GTC ACT CAG GAC AGC 

500 

asp asp ala asp cys 
GAC GAC GCT GAC TGC 

leu arg leu gin asp 
CTC AGA CTT CAA GAC 

530 

pro gly arg glu gin 
CCA GGC CGA GAG CAG 



thr ser gin gly gly 
ACG TCC CAG GGA GGG 

560 

trp glu ser glu ala 
TGG GAG TCT GAG GCC 



564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 



% 
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Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

Motif 0 phhhK hRh R 

Ep p 1 23 . . . KS LGF APGKLRLIPKKT— TFRPIMTFNKKI V. . . 

Sp Tezl ...QKTTLPPAVIRLLPKKN--TFRLITNLRKRFL... 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD... 

Hs TCP1 ...ARPALLTSRLRFIPKPD-GLRPIVNMDYVVG... 
consensus R PK RI 

AF 

U Motif A h hDh GY h 

h Eppl23 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

S SpTezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

.in ScEst2 . . . PEL YFMKFD VKSC YDSIPRMECMRILK. . . 

Hi HsTCPl ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 
T\ consensus F D YD 

% Motif B hPQG pS hh 

u Eppl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

L: SpTezl .. . GNSQ YLQK VGIPQGSILSSFLCHF YME. . 

H Sc Est2 . . . EDKCYIREDGLFQGSSLS APIVDL V YD . . . 

fj HsTCPl ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

O Y 

fu Motif C h FDDhhh 

Ep p 1 23 ...PNVNLLMRLTDDYLLITTQENN... 

SpTezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

Hs TCP! . . . RRDGLLLRL VDDFLL VTPHLTH. . . 
consensus DD L 

Motif D GhhcK 

Eppl23 ...NVSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

Hs TCP 1 . . . LRTL VRG VPE YGC V V . . . 
consensus G 
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FIGURE 50 



1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
30 1 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
35 1 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
401 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGC ACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
60 1 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
801 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGG AG GGTGCGCTCT CTGGC ACGCG CC ACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGG ACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
1051 TCCTC AGGCG AC A AGN AC AC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1 101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
1151 TTCCAGGCCT TGG ATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCC A 
1 20 1 GCGNTACTGG C AAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 
1 301 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1 40 1 CTGCTCCGCC AGC ACAGC AG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
1451 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGC ACAACG 
1 501 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 
1 55 1 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1 601 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCC AAGT TCCTGCACTG GCTGATGAGT 
1 70 1 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCA AGT 
1 801 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1 85 1 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1 90 1 GACGTCCAG A CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCG ATTG 
1951 TGAACATGGA CTACGTCGTG GGAGCCAGA A CGTTCCGCAG AGAAAAG AGG 
2001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCC A CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 G ACCCGCCGC CTG AGCTGTA CTTTGTC AAG GTGG ATGTG A CGGGCGCGTA 
220 1 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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230 1 GCCCATGGGC ACGTCCGCAA GGCCTTC AAG AGCCACGTCT CTACCTTGAC 
235 1 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
255 1 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
260 1 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TC AGGACCCT GGTCCGAGGT GTCCCTG AGT ATGGCTGCGT GGTG AACTTG 
2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
2851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCCA TCAGAGCCAG TCTC ACCTTC AACCGCGGCT TC AAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
300 1 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTC ATCAGC AAGTTTGGAA GAACCCC AC A TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT G AAAGCC AAG 
320 1 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
335 1 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
340 1 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
345 1 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
350 1 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
355 1 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
380 1 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
390 1 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG GGG GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly aso pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 6C 
phe arg ala leu val ala gin cys leu val cys val pro tro asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

30 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG- GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG A^C GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

no 120 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

T 



.30 



5£ PfS ^ ^ thr asp ala leu ar 9 gly ser gly ala 
^ CC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGG CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 
CTG GGA TGC GAA CGG GCC TGG 



val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 



150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGC- CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 

280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 
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290 

giy ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 



gin his his ala gly pro pro 
CAG CAC CAC GCG GGC CCC CCA 

320 

trp asp chr pro cys pro pro 
TGG GAG ACG CCT TGT CCC CCG 



leu tyr ser ser gly asp lys 
CTC TAC TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr iie phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 

leu phe leu glu leu leu gly 
CTG TTT CTG GAG CTG CTT GGG 

410 

val leu leu lys thr his cvs 
GTG CTC CTC AAG ACG CAC TGC 



ala ala gly val cys ala arg 
GCA GCC GGT GTC TGT GCC CGG 



300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTC CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GOT CGG AGG CTC GTG 

370 

arg pro trp met pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr trp gin met arg pro 
CAG CGC TAC TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG GAG TGC CCC TAC GGG 

420 

pro leu arg ala ala val thr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 

430 

glu lys pro gin gly ser val ala 
GAG AAG CCC CAG GGC TCT GTG GCG 
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440 450 
ala pre glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC GTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
ala cys leu arg arg leu val pro pro giv leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu giy iys his ala lys leu ser leu gin glu leu thr tnp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg ara ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

• 530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

, 590 600 

He gly ile arg gin his leu lys arg val crin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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610 

ser glu ala glu val arg gin his arg glu ala arg pre ala leu 
TCG GAA GCA GAG GTC AGG GAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG OCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe ara 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg ara pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 630 
gly ala ser val leu gly leu asp asp ile his arg ala tro arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tvr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tvr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 
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770 780 
thr asp ±eu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser cer 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

S00 810 
ser ieu asn glu ala ser ser gly leu phe asp val ohe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly iys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile ieu ser t*r leu l <=u 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser ieu cys tyr gly asp met glu asn iys ieu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 S70 
ile arg arg asp gly leu leu leu arg leu val aso asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 

880 

leu val thr pro his leu thr his ala iys thr ohe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu aia leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

' 920 930 

thr ala phe val gin met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
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940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 
ser ser tyr ala arg thr ser ile arg ala ser val thr phe asn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

arg gly phe lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC rTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 9go 
val leu arg leu lys cys his ser leu phe leu aso leu aln val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tyr arg phe his ala cys val leu cln leu p-o phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC AC A TTT TTC CTG CGC GTC ATC 

1040 1050 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala gly met ser leu gly ala lys gly a ia ala glv pro leu oro 
GCA GGC- ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 



1110 



arg thr a la gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 



leu thr ala leu glu ala ala ala asn pro ala leu pro ser aso 
CTG AC? GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 



TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAC-CAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGA^-GGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAA.aTTGGGGGGAGGTGC 

TGTGC^AGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAA^ 
AAAAAAAAA* 



1120 



1130 1132 
phe lys thr ile leu asp OP 
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